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Abstract. The spread of influence in social networks is studied in two 
main categories: the progressive model and the non-progressive model 
(see e.g. the seminal work of Kempe, Kleinberg, and Tardos in KDD 
2003) . While the progressive models are suitable for modeling the spread 
of influence in monopolistic settings, non-progressive are more appropri- 
ate for modeling non-monopolistic settings, e.g., modeling difl'usion of 
two competing technologies over a social network. Despite the extensive 
work on the progressive model, non-progressive models have not been 
studied well. In this paper, we study the spread of influence in the non- 
progressive model under the strict majority threshold: given a graph G 
with a set of initially infected nodes, each node gets infected at time r 
iff a majority of its neighbors are infected at time r — 1. Our goal in the 
MinPTS problem is to find a minimum-cardinality initial set of infected 
nodes that would eventually converge to a steady state where all nodes 
of G are infected. 

We prove that while the MinPTS is NP-hard for a restricted family of 
graphs, it admits an improved constant-factor approximation algorithm 
for power-law graphs. We do so by proving lower and upper bounds 
in terms of the minimum and maximum degree of nodes in the graph. 
The upper bound is achieved in turn by applying a natural greedy algo- 
rithm. Our experimental evaluation of the greedy algorithm also shows 
its superior performance compared to other algorithms for a set of real- 
world graphs as well as the random power-law graphs. Finally, we study 
the convergence properties of these algorithms and show that the non- 
progressive model converges in at most 0{\E{G)\) steps. 
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1 Introduction 

Studying the spread of social influence over in networks under various propaga- 
tion models is a central issue in social network analvsis fi9\ ? J3Q|36j . This issue 
plays an important role in several real-world applications including viral mar- 
keting 16114133126] . As categorized by Kempe et al. [26], there are two main 
types of influence propagation models: the progressive and the non-progressive 
models. In progressive models, infected (or influenced) vertices will remain in- 
fected forever, but in the non-progressive model, under some conditions, infected 
vertices may become uninfected again. In the context of viral marketing and 
diffusion of technologies over social networks, the progressive model captures 
the monopolistic settings where one new service is propagated among nodes of 
the social network. On the other hand, in non-monopolistic settings, multiple 
service providers might be competing to get people adopting their services, and 
thus users may switch among two or more services back and forth. As a result, in 
these non-monopolistic settings, the more relevant model to capture the spread 
of influence is the non-progressive model |23|5|15|2'5] . 

While the progressive model has been studied extensively in the literature 
|26|34|21|4|9)lQ)lT] . the non-progressive model has not received much attention 
in the literature. In this paper, we study the non-progressive influence models, 
and report both theoretical and experimental results for our models. We focus 
on the the strict majority propagation rule in which the state of each vertex at 
time r is determined by the state of the majority of its neighbors at time r — 1. 
As an application of this propagation model, consider two competing technolo- 
gies (e.g. IM service) that are competing in attracting nodes of a social network 
to adopt their service, and nodes tend to adopt a service that the majority of 
their neighbors already adopted. This type of influence propagation process can 
be captured by applying the strict majority rule. Moreover, as an illustrative ex- 
ample of the linear threshold model ^26], the strict majority propagation model 
is suitable for modeling transient faults in fault tolerant systems |17|31|18] . and 
also used in verifying convergence of consensus problems on social networks [29] . 
Here we study the non-progressive influence models under the strict majority 
rule. In particular, we are mainly interested in minimum perfect target set prob- 
lem where the goal is to identify a target set of nodes to infect at the beginning 
of the process so that all nodes get infected at the end of the process. We will 
present approximation algorithms and hardness results for the problem as well 
experimental evaluation of our results. As our main contributions, we provide 
improved upper and lower bounds on the size of the minimum perfect target set, 
which in turn, result in improved constant-factor approximations for power-law 
graphs. Finally, we also study the convergence rate of our algorithms and re- 
port preliminary results. Before stating our results, we define the problems and 
models formally. 

Problem Formulations. Consider a graph G{V,E). Let N{v) denote the set 
of neighbors of vertex v, and d{v) = \N{v)\. Also, let A{G) and S{G) denote 
the maximum and minimum degree of vertices in G respectively. The induced 
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subgraph of G with a vertex set S C V{G) is denoted by G[S]. Also ds(v) 
denotes the number of neighbors of v in subset S. 

A 0/1 initial assignment is a function /o : V{G) {0, 1}. For any 0/1 initial 
assignment /o, let fr ' V{G) {0,1} ^ 1) be the state of vertices at time 
r and t{v) be the threshold associated with vertex v. For the strict majority 
model, the threshold t{v) = [^^^^^] for each vertex v. 

In the non-progressive strict majority model: 



, ,._jOiiEueNiv)fr-liu)<t{v) 



/t-i(u) > t{v) . 

In progressive strict majority model: 



fr{v) 



(Oiffr-l{v 
llif /r-l(^ 



and EnGiV(^) fr-i{u) < t{v) 



Strict majority model is related to the linear threshold model in which t{v) is 
chosen at random and not necessarily equal to [^^^^^]. 

AO/1 initial assignment /o is called a perfect target set (PTS) if for a finite 
fri^) = 1 for all V e V{G)^ i.e., the influence will converge to a steady state of 
all I's. The cost of a target set /o, denoted by cost(/o), is the number of vertices 
V with fo{v) = 1. The minimum perfect target set (MinPTS) problem is to find a 
perfect target set with minimum cost. The cost of this minimum PTS is denoted 
by PPTSiG) and NPPTS{G) respectively for progressive and non-progressive 
models. This problem is also called target set selection [T]. Another variant of 
this problem is the maximum active set problem [Ij where the goal is to find at 
most k nodes to activate (or infect) at time zero such that the number of finally 
infected vertices is maximized. 

A graph is power-law if and only if its degree distribution follows a power-law 
distribution asymptotically. That is, the fraction P{x) of nodes in the network 
having degree x goes for large number of nodes as P{x) = ax~^ where a is a 
constant and 7 > 1 is called power-law coefficient. It is widely observed that 
most social networks are power-law [12 . 

Our Results and Techniques. In this paper, we study the spread of influence 
in the non-progressive model under the strict majority threshold. We present 
approximation algorithms and hardness results for the problem as well exper- 
imental evaluation of our results. As our main contributions, we provide im- 
proved upper and lower bounds on the size of the minimum perfect target set, 
which in turn, result in improved constant-factor approximations for power- 
law graphs. In addition, we prove that the MinPTS problem (or computing 
NPPTS{G)) is NP-hard for a restricted family of graphs. In particular, we 
prove lower and upper bounds on NPPTS{G) in terms of the minimum degree 
{S{G)) and maximum degree (zA(G)) of nodes in the graph, i.e., we show that 
< NPPTSiG) < 4^(GHlli?i)SffG)-2) - The proofs of these bounds 
are combinatorial and start by observing that in order to bound NPPTS{G) 
for general graphs, one can bound it for bipartite graphs. The upper bound is 
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achieved in turn by applying a natural greedy algorithm which can be easily 
implemented. Our experimental evaluation of the greedy algorithm also shows 
its superior performance compared to other algorithms for a set of real-world 
graphs as well as the random power-law graphs. Finally, we study the conver- 
gence properties of these algorithms. We first observe that the process will always 
converges to a fixed point or a cycle of size two. Then we focus on the conver- 
gence time and prove that for a given graph G, it takes at most 0{\E{G)\) rounds 
for the process to converge. We also evaluate the convergence rate of the non- 
progressive influence models on some real- world social networks, and report the 
average convergence time for a randomly chosen set of initially infected nodes. 

More Related Work. The non-progressive spread of influence under the strict 
majority rule is related to the diffusion of two or more competing technologies 
over a social network [ 23|5|15|25 . As an example, an active line of research in 
economics and mathematical sociology is concerned with modeling these types of 
diffusion processes as a coordination game played on a social network |23|5|15|25] . 
Note that none of these previous prior work provide a bound for the perfect target 
set problem. 

It has been brought to our attention that in a relevant unpublished work by 
Chang [7 , the MinPTS problem on pawer-law graphs is studied and the bound 
of NPPTS{G) = 0([27^1) is proved under non-progressive majority models 
in a power-law graph. But his results do not practically provide any bound for 
the strict majority model. We will show that our upper bound is better and 
practically applicable for different amounts of 7 under strict majority threshold. 

Tight or nearly tight bounds on the PPTSiG) are known for special types 
of graphs such as torus, hypercube, butterfly and chordal rings |16|17|27|31|32j . 
The best bounds for progressive strict majority model in general graphs are due 
to Chang and Lyuu. In [9 , they showed that for a directed graph G, PPTSiG) < 
P \V{G) I . In [8 , they improved their upper bound to | \V{G) \ for directed graphs 

and ^^^2^^^ for undirected graphs. However, to the best of our knowledge, there 
is no known bound for NPPTS{G) for any type of graphs. In this paper, we will 
combinatorially prove that < NPPTS{G) < 4^(GH(SGm)WG)-2) ■ 

It is known that the Target Set Selection problem and Maximum Active Set 
problem are both NP-hard in the linear threshold model [26] , and approximation 
algorithms have been developed for these problems. Kempe et al. [26] and Mossel 
and Roch [28 present a (1 — ^)-approximation algorithm for the maximum active 
set problem by showing that the set of finally influenced vertices as a function 
of the originally influenced nodes is submodular. On the other hand, it has been 
shown that the target set selection problem is not approximable for different 
propagation models |21|4|8|1Q] . The inapproximability result of Chang and Lyuu 
in [8 on progressive strict majority threshold model is the most relevant result 
to our results. They show that unless NP C TIME{n^^^^^^'^'^)^ no polynomial 
time ((1/2 — e) In | V|)-approximation algorithm exists for computing PPTSiG). 
To the best of our knowledge, no complexity theoretic results have been obtained 
for the non-progressive models. 
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The problem of maximizing social influence for specific family of graphs has 
been studied under the name of dynamic monopolies in the combinatorics liter- 
ature [16 1 7|27|31|32|8|l|7j . All these results are for the progressive model. The 
optimization problems related to the non-progressive influence models are not 
well-studied in the literature. The one result in the area is due to Kempe et al. 
[26] who presented a general reduction from non-progressive models to progres- 
sive models. Their reduction, however, is not applicable to the perfect target set 
selection problem. 



2 Non-Progressive Spread of Influence in General Graphs 

In this section, we prove lower bound and upper bounds for minimum PTS in 
graphs, and finally show that finding the minimum PTS in general graphs is 
NP-Hard. 

Lower bound. The following theorem shows that if we have some lower bound 
and upper bound for minimum Perfect Target Set in bipartite graphs then these 
bounds could be generalized to all graphs ( Theorem [T]). 

Lemma 1. If a\V{H)\ < NPPTS{H) < I3\V{H)\ for every bipartite graph H 
under strict majority threshold, then a\V{G)\ < NPPTSiG) < /3|V(G)| under 
strict majority threshold for every graph G (see appendix\A^. 

The following lemma shows characteristics of PTSs in some special cases. 
These will be used in proof of our theorems. 

Lemma 2. Consider the non-progressive model and let G = (X^Y) be a bipar- 
tite graph and /o be a perfect target set under strict majority threshold. For every 
S C V{G) if ^y^snx fo{^) = or ^^^snv foi^) = 0; then there exists at least 
one vertex u in S such that ds{u) < d{u) — t{u) (see appendix\A^. 

If the conditions of previous lemma holds, we can obtain an upper bound for 
number of edges of the graph. Following lemma provides this upper bound. 
This will help us finding a lower bound for NPPTS of graphs. The function 
t : V{G) N may be any arbitrary function but here it is interpreted as the 
threshold function. 

Lemma 3. Consider a graph G with n vertices. If for every S C V{G) there 
exists at least one vertex v for which ds{v) < d{v) — t{v), then \E{G)\ < 
EnGy(G)(^H -^(^)) (^^^ appendix^. 

The following theorem shows that for every bipartite graph G, NPPTSiG) > 
A^cyli ' Lemma jlj generalizes this theorem to all graphs. Also, TheoreniH shows 
that this bound is tight. In the following, the induced subgraph of Gwith a 
vertex set S C V{G) is denoted by G[S]. 

Theorem 1. For every bipartite graph G = {X^Y) of order n, NPPTS{G) > 

2n 
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Proof. Let /o be an arbitrary PTS for G. Partition the vertices of graph G into 
three subsets Bx^ By and W as follow. 

Bx = {veX\f^{v) = l} 
BY = {veY\h{v) = l} 
W = {veV{G)\h{v)=0} 

Consider the induced subgraph of G with vertex set Bx U W and suppose that 
S C Bx^-^W . For every vertex v G Ffl^, we have /o('^) = 0. So By Lemma|2j for 
every S C Bx U W there is at least one vertex u such that dsiu) < d{u) — t{u). 
By Lemma I3I this implies that G[Bx U W] has at most ^ueBx^w^^^^) ~ ^('^)) 
edges. Similarly we can prove that G[5yLJVK] has at most '^ueBY^w(^(^)~'^(^)) 
edges. Let ew be the number of edges in G[W^], ewx be the number of edges 
with one end point in Bx and the other end point in W and ewv be the number 
of edges with one end point in By and the other end point in W . we have: 

ewx -^ew < ^ {d{v) - t{v)) 

veBx^W 

ewY ^ew < ^ {d{v) - t{v)) 
veBY^iW 



and so, 

veviG) vew 
The total degree of vertices in W is X^^gvf ^i^) ~ ^wx + ^wy + ^ew Thus 

Y.d{v)< J2 {d{v)-t{v))+Y,id{v)-t{v)) 

vew vev{G) vew 

If we denote the set of vertices for which /o is equal to 1 by 5, we have 

^{2t{v)-d{v))<Y,{d{v)-t{v)) (1) 

vew veB 



For every vertex t{v) > ^M+l^ go 

veB 

And the proof is complete. 



We now show that the bound in Theorem |4] is tight. 
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Algorithm 1 Greedy NPPTS 

sort the vertices in G in ascending order of their degrees as the sequence t'l , . . . , Vi 
for i — 1 to n do 

whiteadj [vi] — 

blocked[^'^] = 
end for 

for i = 1 to n do 

for each u ^ N{vi) do 

if whiteadj [u] = d{u) — t{u) then 

blocked[t'i] = 1 
end if 
end for 

if (blocked[vi] = 1) then 

Mv) = 1 

else 

Mv) = 

for each u ^ N{vi) do 

whiteadj = 1 
end for 
end if 
end for 



Lemma 4. For infinitely many n^s, there exists a 2d ^ 1-regular graph with n 
vertices such that NPPTS{G) = under strict majority rule (see appendix 

HP- 

Upper bound. In this section, we present a greedy algorithm which gives an 
upper bound for NPPTS{G). 

Theorem 2. For every graph G of order n, NPPTS{G) < 4/\^(^^_f^)^^_2) 
der strict majority threshold. 

Algorithm [l] guarantees this upper bound. This algorithm gets a graph G of 
order n and the threshold function t as input and determines the values of /o 
for each vertex. 

Lemma 5. The algorithm Greedy NPPTS finds a Perfect Target Set for non- 
progressive spread of influence. 

Proof. By induction on the number of vertices for which /o is determined, we 
prove that /o remains a PTS after each step of algorithm if we assume that /o 
is 1 for undetermined values. It is clear that the claim is true at the beginning. 
Consider a set of values of /o which forms a PTS and let v be a vertex for 
which value of fo{v) is set to by the algorithm in the next step. By induction 
hypothesis, /o is a PTS if fo{v) is assumed to be 1. According to the algorithm, 
fo{v) is set to iff the value of bio eked ['u] is zero i.e. no adjacent vertex of 
say ix, has exactly d{u) — t{u) adjacent initially uninfected vertices. So by setting 
/o('i;) to 0, each initially infected vertex w still has at least t{w) infected vertices 
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and also v has at least t{v) initially infected neighbors itself. Thus, after one step 
of propagation, all initially infected vertices plus v are infected and by induction 
hypothesis, all vertices will be infected eventually and so /o remains a PTS. 

Lemma 6. For every graph G of order n, Greedy NPPTS guarantees the upper 
hound of 4/\^(^^_fi)^^_2) /^^ NPPTS{G) under strict majority threshold where 
A and 6 are maximum and minimum degree of vertices respectively. 

Proof According to the algorithm, for each vertex the value of fo{v) is set to 
1 iff whiteadj [ix] = d{u) — t{u) for some u G N{v). Let S be the set of vertices 
u for which whiteadj [u] = d{u) — t{u). B and W denote the set of infected and 
uninfected vertices respectively. We have: 

J2{d{v) - t{v)) < d{v) =^ - 1) < E ^(^) 

ves vew ves vew 

Therefore, 

- 1)1^1 <Zi|H^|.^ 1^1 < ^11^1 



Each vertex in B has at least one adjacent vertex in S and each vertex v E S 
has at least d{v) —t{v) adjacent edges to W and so at most t{v) adjavent edges 
to 5, thus: 

\B\ < < + 1) < 2\S\ + J2 d{^) 

ves ves vew 

2A 4A + A(S - 2) 
<2\S\+ A\W\ < (2j-^ + A)\W\ < ^-\W\ 

Thus, 

A{S^2) 



\B\ < 



4Z\ + (zA + l)(5-2) 



The approximation factor of the algorithm follows from previous claim and 
the lower bound provided by Theorem [l] 

Corollary 1. The Greedy NPPTS algorithm is a 8zA+2(^+i)(^-2) ctpP'^oximation 
algorithm for NPPTS problem. 

NP-Hardness. In this section, we use a reduction from the Minimum Dominat- 
ing Set problem (MDS) |3] to prove the NP-hardness of computing NPPTS{G). 
The proof of following theorem is provided in appendix [A} 

Theorem 3. If there exists a polynomial-time algorithm for computing NPPTS (G) 
for a given graph G under the strict majority threshold, then P = NP. 
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3 Non-Progressive Spread of Influence in Power-law 
graphs 

In this section, we investigate the non-progressive spread of influence in power- 
law graphs, and show that the greedy algorithm presented in the previous section 
is indeed a constant-factor approximation algorithm for power-law graphs. For 
each natural number x, we assume that the number of vertices with degree x is 
proportional to x~^ and use a as the normalization coefficient. The value of 7, 
known as power-law coefficient, is known to be between 2 and 3 in real- world 
social networks . We denote the number of vertices of degree x by P{x) = ax~^ . 
Let n be the number of vertices of graph, so we have: 

00 

n = > ax~^ = aCM =^ a = 

ti C(7) 

where is the Riemann Zeta function [24] . 

Lower bound. Consider a power-law graph G with a threshold function t and 
a perfect target set /q. Denoting the set of initially influenced vertices by B and 
the rest of the vertices by VK, from the Equation [T] we have: 



Y,{2t{v)-d{v))<Y,{d{v)-t{v)). 



vew veB 

The maximum cardinality of W is achieved when the degree of all vertices in B 
is greater than or equal to the degree of all vertices in W. In this case, assume 
that the minimum degree of vertices in 5 is A: and < p < 1 is the proportion 
of the vertices of degree k in 5, so under strict majority threshold we have: 

k-1 

ax-^ + (1 - p)ak-^ <\W\< {2t{v) - d{v)) 
x=i vew 

x-1^ 



veB x=k-\-l 



x=l 

=^C(7)-C(7,^-1) + (1-P)^"^ 

^ ch-hk)-a^^k)^pk-^k-i) 

2 

By estimating the value of Riemann Zeta function, we can estimate the upper 
bound of k and lower bound of p for that k to provide a lower bound for \B\. 
Assuming that we have the maximum possible value of k and minimum value of 
p for that /c, then: 



\B\ > > ax ^ ^ apk ^ = -— — n. 

x=k+l ^^^^ 
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The estimated values of lower bound for 2 < 7 < 2.8 is shown in Figure [l] 
Upper bound Suppose that one has run Greedy NPPTS algorithm under strict 
majority threshold on a graph with power-law degree distribution. The follow- 
ing theorem shows that unlike general graphs, the Greedy NPPTS algorithm 
guarantees a constant factor upper bound on power-law graphs. 

Theorem 4. Algorithm Greedy NPPTS initially influences at most (1 + — 
2^^)n vertices under the strict majority threshold on a power-law graphs of 
order n. 

Proof. We may assume that the input graph is connected. We prove that the 
number of uninfected vertices of degree 1 are sufficient for this upper bound. 
Let V be a vertex of degree more than 1 with k adjacent vertices of degree 1 say 
ui^U2 . . .Uk. If d{v) is odd, it is clear that at least | of the vertices Ui^U2 . . .u^ 
will be uninfected since k < d{v). Note that according to the greedy algorithm, 
the value of /o for degree 1 vertices are determined before any other vertex. If 
d{v) is even, at least | — 1 of vertices ui^U2 . . .Uk will be uninfected. Therefore 
we have: 



^ 00 
NPPTS{G) <n- -(P(l) - ^ P{2x)) 



x=l 



<n—-[a- ay , ) 



= n-f(l-^C(7)) = n(l + ^-^) 

By previous lemma, we conclude that the Greedy NPPTS algorithm is a 
constant-factor approximation algorithm on power-law graphs under strict ma- 
jority threshold. The lower bound and upper bound for different values of 7 
are shown in Figure [T] As you can see our algorithm acts optimally on social 
networks with large value of power-law coefficient since upper and lower bound 
diagram meet each other for these values of power-law coefficient. 



4 Experimental Evaluations 

In this section we run our algorithm on real-world social networks as well as 
random power-law graphs with a wide range of power-law coefficients. Following 
the method used in [26 , we compare the performance of our algorithm to other 
heuristics for identifying influential individuals. 

Random power-law and real- world networks. We evaluate the performance 
of the greedy algorithm on graphs with various amount of power-law coefficient. 
Following a previously developed way of generating power-law graphs from [2], 
we set two parameters a and 7 defined as follows: a is the logarithm of the 
graph size and 7 is the log-log growth rate (power-law coefficient). For details 
of the way to generate random power-law graphs, see Appendix [?]. We also 
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1.4 1.6 



Fig. 1. Values of upper bound and lower bound in power-law graphs 



run our algorithms over four social networks' data: Who-trusts-whom network 
of Epinions.com, Slashdot social network, collaboration network of Arxiv Astro 
Physics, Arxiv High Energy Physics paper citation network, Amazon product 
co-purchasing network. In cases where graph is not connected we select graphs' 
giant component. 

Setup. We compare our greedy algorithm with heuristics based on nodes' de- 
grees and centrality within the network, as well as the baseline of choosing ran- 
dom nodes to target. High-Degree and distance-centrality heuristics choose ver- 
tices in the order of decreasing degree and decreasing average distance to other 
nodes. These heuristics are commonly used in the social science literature as 
estimates of a node's influence in the social network [35"26'. 

In each of these cases, in each step, we check whether the selected vertices 
are a perfect target set or not. This can be easily verified by simulating spread 
of influence process until the states of vertices become stable. The simulation 
process ends at a polynomially bounded time r when for each v G V{G) we have 
f^{v) = fr-2{'^) (see Theorem [5] and Theorem |6|. 

Notice that because the optimization problem is NP-hard (Theorem [3|, and 
the testbed graphs are prohibitively large, we are not able to compute the opti- 
mum value to verify the actual quality of approximations. 



Experimental Results. Figure 2(a) shows the performance of our algorithm 
in comparison to introduced heuristics on random power-law graphs. For any 
value of 7 (power-law coefficient), all heuristics pick almost entire vertices of the 
graph while our algorithm pick a number of them between proved lower-bound 
and upper-bound. The same phenomena happens for the four real-world social 
networks data. The results are depicted in Figure |2(b)[ 



5 Convergence Issues 

Let the state graph of a non-progressive spread of influence process for graph 
G be as follows: Each node of this graph represents one of possible states of 
the graph. An edge between two states A and B in H models the fact that 
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(a) Results on the random power-law graphs (b) Results on the real network data 



applying one step of the influence process on state A changes the state to state 
B. First of all, one can easily see that the non-progressive model may not result 
to a singleton steady state for any dynamics. To see this, consider the following 
example: a cycle with 2k vertices C = viV2---^2/c and at time infect vertices 
with odd indices. In this case, the process will oscillate between exactly two 
states. In fact, one can show a general theorem that any dynamics will converge 
to either one or two states: 

Theorem 5. The non-progressive spread of influence process on a graph reaches 
a cycle of length of at most two (see appendix\D^. 

Using this intuition, one can define the convergence time of a non-progressive 
influence process under the strict majority rule as the time it takes to converge to 
a cycle of size of two states, i.e., the convergence time is the minimum time T at 
which /t('^) = /t+2('^) for all vertices v G V{G). For a set S of initially infected 
vertices, let ctciS) to be the convergence time of the non-progressive process 
under the strict majority model(T). In the following theorem, we formally prove 
an upper bound of 0{\E{G)\) for this convergence time: 

Theorem 6. For a given graph G and any set S C V{G), we have ctciS) = 

omG)\). 

Proof In each time step r of the non-progressive spread of influence, all the 
vertices apply the function f^ concurrently. In order to prove the theorem for 
such concurrent dynamics , we first define a simplified sequential dynamics^ prove 
the convergence time for this simplified dynamics, and finally give a reduction 
from the concurrent to the sequential dynamics. In sequential dynamics, the 
vertices apply the influence process one by one in a sequence of rounds, where 
in each step one vertex applies the influence process exactly once. 

We first show that the sequential dynamic on every graph G and under the 
strict majority model converges after at most 0{\E{G)\.\V{G)\) steps. To see this 
bound, consider the following potential function for a graph G: the number of 
edges whose endpoints have different states. One can see that whenever a vertex 
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changes its state from uninfected to infected the potential of G wih decrease 
at least one and otherwise it remains unchanged. Consider a vertex which has 
k state changes during the process until it final convergence. At least k/2 of 
these changes were from uninfected state to infected and so they cause one 
decrement in the potential function. The initial amount of G's potential is at 
most \E{G)\ and in each step (or consecutive steps), we have at least one 

state change. So after at most 2\E{G)\.\V {G)\ steps the potential of G would 
reach its minimum, and the proof for the sequential dynamics is complete. 

Now using the above observation, we show that the concurrent dynamics 
convergences fast. Consider graph H = (X, Y) built from G in Lemma [l] 
We show that for every concurrent dynamics in G with convergence time of 
T, there is an equivalent sequential dynamics in H with convergence time of 
c\V{G)\T for some constant c. This will prove etc ^ 0{\E{G)\)^ since we know 
that the convergence time of the sequential dynamic in graph H is at most 
2\V{H)\.\E{H)\ = S\V{G)\.\E{G)\ = c\V{G)\.T. So T e 0{\E{G)\). The main 
claim follows from the proof of Lemma [l] By induction on the number of steps, 
we can show that the state of vertices in G is equal to the state of vertices in 
X at odd steps and is equal to the state of vertices in Y at even steps (as we 
did in the proof of Lemma [T]) . Now order vertices of X and Y with numbers 
1,2, • • • , \V{G)\ and from \V{G)\ + 1 to 2|V(G)|. It is easy to see that the se- 
quential dynamics with this ordering, after |V(i^)| steps, has the same outcome 
under the concurrent dynamics in this graph. 

The above theorem is tight i.e. there exists a set of graphs and initial states 
with convergence time of 0{\E{G)\). In power-law graphs since average degree is 
constant, the number of edges is 0(|y|) and thus the convergence time of these 
graphs is 0(|F|). 

Finally, we study convergence time of non-progressive dynamics on several 
real- world graphs, and observe the fast convergence of such dynamics on those 
graphs. See Appendix [C] for details. 

6 Conclusions 

In this paper, we study the minimum target set selection problem in the non- 
progressive infiuence model under the strict majority rule and provide theoretical 
and practical results for this model. Our main results include upper bound and 
lower bounds for these graphs, hardness and approximation algorithm for this 
problem. We also apply our techniques on power-law graphs and derive improved 
constant-factor approximation algorithms for this kind of graphs. 

An important follow-up work is to study the minimum perfect set problem for 
non-progressive models under other infiuence propagation rules, e.g. the general 
linear threshold model. It is also interesting to design approximation algorithms 
for other special kinds of complex graphs such as small- world graphs. Another 
interesting research direction is to study maximum active set problem for non- 
progressive models. 
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A Proofs from Section [2] 

In this section we give the missing proofs from Section [2j 
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Proof of Lemma [1} Consider a graph G with n vertices and vertex set V{G) = 
{^1,^2, ' ' - Vn} and threshold function t. Assume that there is a Perfect Target 
Set /o for G such that cost{fo) < a\V{G)\. Let H = (X^Y) be a bipartite 
graph such that X = {xi, . . .Xn} and Y = . . .yn} and t' be the threshold 
function of vertices of H such that for every 1 < i < n, t'{xi) = t'{yi) = t{vi). 
Define E{H) = {xiyj\viVj G E{G)}. Let go be a Target Set for H such that 
go{xi) = go{yi) = foi^i) for every 1 < i < n. We claim that go is a PTS for H. 
By induction on r, we prove that gr{xi) = griyi) = fri^i) for every 1 < i < n. 
By the definition, the assertion is true for r = 0. Now let the assertion be true 
for time r. Consider a vertex Xi G X. We have ^y^j^^^.^ gr{y) = T^y^Nivi) fri"^) 
and also t{xi) = t{vi)^ thus Xi is influenced at time r + 1 by ^0 iff '^i is influenced 
at time r + 1 by /q. By similar justification we can show that gr-\-i{yi) = fr-\-i{vi) 
too. So go is a PTS for H iff fo is a PTS for G, which is a contradiction since 
by assumption NPPTS{H) > a\V{H)\ but cost{go) < a\V{H)\. 

Now we prove that NPPTS{G) < P\V{G)\. Consider the bipartite graph 
H with the aforementioned definition. By assumption there is a Perfect Target 
Set go with weight at most /3\V{H)\ for H. With no loss of generality assume 
that the number of vertices in X for which g^ is one (initially infected vertices) 
is less than the number of initially infected vertices of Y. Let /q be a PTS for 
G such that fo{vi) = g'oixi) for every 1 < i < n. We have cost{go) < P\V{G)\ 
since = 2|]/(G)|. By induction on r we show that f2r{^i) = 92ri^i) 

/2r+i('^0 = 92T+i{yi) every 1 < i < n and every r > 0. The assertion is 
trivial for r = 0. Now let the assertion be true for time 2r. Consider a vertex 
Vi e V{G). We have T^veNM /2r(^) = ExGiV(y.) ^2r(^) and also t{vi) = t{yi), 
thus Vi is influenced at time 2r + 1 by /q iff yi is influenced at time 2r + 1 by 
g'o. By similar justification we can show that /2r+2('^0 — ^2r+2(^i) ^oo- 9o i^ 
a PTS for H iff /(^ is a PTS for G and so NPPTS{G) < f3\V{G)\. 

In the following, ds{v) denotes the number of neighbors of v in subset. 

Proof of Lemma [2| Consider a set C V{G). With no loss of generality, 
suppose that /o('^) = for every v G S H X. We prove the lemma by contradic- 
tion. Assume that for every u e ds{u) > d{u) — t{u). For every y e S OY, 
fi{y) = since y has at least d{y) — t{y) + 1 adjacent vertices in 6* fl X for 
which fo is zero. Similarly, for every x e S D X ^ f2{x) = since x has at least 
d{x) — t{x) -\- 1 adjacent vertices in 5 fl F for which /i is zero, and so on. Thus 
fo is not a Perfect Target Set, a contradiction. 

Proof of Lemma[3[ We prove the lemma by induction on n. For n = 1 the asser- 
tion is trivial. Consider a graph G with n vertices. Let S = V{G). By assumption, 
there is at least one vertex such that d{v) < d{v) — t{v). Remove the vertex 
V from G. By induction hypothesis G — v has at most XlnGy(G-v)(^(^) ~ ^('^)) 
edges, so G has at most XlnGy(G)(^('^) ~ ^('^)) edges. 

Proof of Theorem[4| Consider a ((i+l)-regular graph Gi with mi vertices . In 
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step i {I < i < oo), Add m^+i = -^rrii vertices to the graph and connect each 
of them to Gi hy d -\- 1 edges. Each vertex of Gi must receive exactly d newly 
edges. Name the subgraph formed by these vertices G^+i. This process is shown 
in Figure [2J The final graph has n = Xli^i = mi{d-\- 1) vertices. It is easy to 
show that V{Gi) is a PTS, so NPPTS{G) < \V{Gi)\ = rm = ^ = J^. 




Proof of Theorem [sj In an instance of the minimum dominating set problem 
(MDS), given a graph G{V^E), our goal is to find find a subset 5* C V{G) of 
minimum cardinality such that for any node v ^ we have S fl N{v) ^ 0. We 
give reduction from this NP-hard problem to our problem. Given an instance G 
of MDS with V{G) = {i^i, 1^2, •••7 '^n} and \E{G)\ = e, we define an undirected 
graph H as follows (See Figure [3|: First, let 



^0 = {91^92} 
X2 = {bi\l < i < 2e + l} 
X4 = {^i|l <i<n} 
Xe = {di\l < i < 2e}. 



Xi = {ai\l < i < 2e + 1} 

= {ci\l <i<2e} 
X^ = {vi\l<i<n} 



Now let H{V, E) be 



V{H) = uto^^ 



On the Non-progressive Spread of Influence through Social Networks 



17 




Fig. 3. The graph H 



E{H) = {giai\l < i < 2e + 1} 
U {g2bi\l<i<2e^l} 
U {giCi\l<i<2e} 
U {g2Ci\l<i<2e} 

i—l i 

U {wiCj\l < i < n.'^diuk) <j< y^^d{uk)} 

k=l k=l 

U {viWj\uiUj G E{G) V i = j} 

i—l i 

U {vidj\l < i < n,y^^d{uk) <j< y^^d{uk)}. 



Suppose that D is a minimum dominating set for G. Define = {vi\ui G D}. 
We show that NPPTS{G) = 2e + n + 4 + |D|. It is easy to see that vertices in 
XoUX3UX4Ul^^ plus one vertex from each of Xi and X2 form a Perfect Target 
Set for the graph H. So, we have NPPTS{H) < |Xo| + IX3I + IX4I + + 2 = 
2e + n + 4+ \D\. 

It remains to prove that NPPTS{H) > 2e ^ n ^ 4 ^ \D\. Suppose that 
S C V{H) is a PTS for H with minimum cardinality. Consider vertex gi in time 
r. If fr{gi) = 0, in time r + 1 for every vertex G Xi we will have fr-\-i{ai) = 
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and then /r+2(5'i) = 0. So we have, gi G S. Similarly, we have ^2 ^ S. Moreover, 
at least 2e + 1 vertices from each of gi or neighbors must be in so w.l.o.g 
suppose that Xa's members plus at least one vertex from each of Xi and X2 are 
in S. By this setting, the vertices of Xq U Xi U X2 U X3 become infected and 
keep this infection for every r > 0. 

Consider a vertex ^ X4. Let B{wk) = {di G XQ\di is reachable from Wk 
by a path of length 2}. Suppose that Wk ^ ^S. If there exists a G B{wk) H 5, 
we replace it by Wk in 5. This modification does not prevent 5* from being a 
PTS and also does not increase So, we may assume that B{wk) f^ S — ^ 
when Wk ^ ^S*. Now, consider one of Wk^ neighbors in X5 such as Vp. None of 
Vp'^ neighbors in Xq are infected initially. Thus Vp has at most d{up) initially 
infected neighbors, this implies that fi{vp) = and it is true for all other Wk^s 
neighbors in X5. Similarly, f2{wk) = and f2{dj) = for all dj G B{wk)- 
Similar to this argument, one can show that for every r > 0, /2r('^/c) = and 
f2T{dj) = for all dj G B{wky Therefore, for every Wk G X4, at least one of its 
neighbors in X5 must be in S. This means that 5* fl X5 must have at least \D\ 
vertices and the proof is complete. 



B Experimental Evaluation Data on Real Networks 

Generating andom power-law networks. We evaluate the performance of 
the greedy algorithm on graphs with various amount of power-law coefficient. 
Following a previously developed way of generating power-law graphs from [2 , 
we set two parameters a and 7 defined as follows: a is the logarithm of the 
graph size and 7 is the log- log growth rate (power-law coefficient). The number 
of vertices with degree x, y satisfies 

log 7/ = a — 7logx. 

The random power-law graph model is defined as follows: given n weighted 
vertices with weights wi^ W2-, • • • , a pair (i, j) of vertices appears as an edge 
with probability WiWjp independently. These parameters p and wi^W2^ - • • 
must satisfy 

- W{i\wi = 1} = [e^^J -r and \\{i\wi = k} = [f^J for k = 2,3,.., [e^J. Here a 
is a value minimizing \n - Y.k'=i^ LfrJ I r = n- Y.k'=i LfrJ • 

One can easily see the expected degree of z'th vertex would be Wi and also 
vertices' weights follow power-law. 

Experimental results for four real- world networks. Table [l] includes the 
exact amount of greedy NPPTS's output compared to the output of other heuris- 
tics. 
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Table 1. Results on the real networks 



Network 


No. of 

nodes 


7 


No. of 

Greedy 


nodes select( 
High Degree 


3d by al^ 
Central 


^orithm 
Random 


Who-trusts-whom network of Epin- 
ions.com 


75888 


1.50 


27131 


75878 


75879 


75888 


Slashdot social network 


77360 


1.68 


49978 


77327 


77360 


77360 


Collaboration network of Arxiv As- 
tro Physics 


18772 


1.84 


8287 


18771 


18772 


18763 


Arxiv High Energy Physics paper 
citation network 


34546 


2.05 


14647 


34539 


34546 


34505 


Amazon product co-purchasing net- 
work 


262111 


2.54 


155085 


262111 


262005 


262026 



C Missed Things from Section [5] 

Proof of theorem [Hj In [20 , it is shown that, for a function A from {0, 1}^ 
to {0, 1}^ whose components from a symmetric set of threshold functions, the 
repeated application of Z\, leads either to a fixed point or to a cycle of length 
two. Since the set of functions (defined in Section [T]) are symmetric threshold 
functions, the lemma follows immediately from this fact. 

Average convergence time of the process on social networks. Applying 
a sampling technique and simple concentration inequalities, one can easily show 
that the average convergence time of the non-progressive process on graph G 
can be approximated with an additive error of e in time 0{ ^ -^^gs^^) ) where 
e= \E{G)\ andn= \V{G)\. 

Theorem 7. Computing the average convergence time of the non-progressive 
process on graph G, with an error of e is possible in time •niog(n) ^/^g^g 
e= \E{G)\ andn= \V{G)\. 

Proof Define random variable Xs = ctc^S). We uniformly select some of the 
y(G)'s subsets Si^ 82-, Sm and take the average of A^-.s. In |22j, Hoeffding 
shows that with large value of m and if A^-.s are bounded between and 6^, 
A^ would be a good estimation (with an error less than e) for ^[A^] that is our 
desired target: 

/ 2e^m^ 

Pr(\Xs-ms]\>e)<2e^V 



From Theorem |6] we know putting = and hi = 8e for all 1 < z < m, meets 
the preconditions of the above inequality. To have Pr{\Xs — E[A5']|) < we 
can set 

2 ln{n) YT=iipi - (^if _ M.m.e^.ln{n) 64e^/n(n) 
Since computing each A^. needs 0{n) the total time will be at most 0{mn) = 

^^ e^.nlog(n) ^ 
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Corollary 2. Computing the average convergence time of the process on a power- 
law graph G, with an error of e is possible in time ^^|^^^ ) where n = \V{G)\. 

As a result, we can perform experimental evaluation of convergence time in 
several families of graphs. In particular, through experimental evaluations, we 
show the average time of convergence for random power law graphs with e = 0.1. 
Figure [4] shows average convergence time calculated by sampling for 500 random 
power law graphs with average of 100 vertices. 




Fig. 4. Average convergence time on random power-law graphs 



